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A method is provided for the selection of diagnostic genetic markers fragments and useful in the identification of bacteria at the 
genus, species or serotype level. The method first involves the identification of a RAPD polymorphic DNA fragment common to a particular 
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the presence of a marker within the fragment whereby that set of primers is then useful in the identification of all members of the chosen 
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Use of a specific marker for detection of Salmonella 
with PCR. 



5 f TFT.D OF TTWENT TON 

The invention relates to the field of molecular 
biology and the use of randomly amplified nucleic acid 
fragments for the selection of genetic markers useful in 
the identification of bacteria at the genus, species or 
10 serotype level. This invention further relates to a 

specific DNA marker sequence useful for the detection of 
Salmonella, and use of that diagnostic marker to 
determine if an unknown bacterium is a member of the 
genus Salmonella . 

15 BACKGROUND 

An integral aspect of the field of microbiology is 
the ability to positively identify microorganisms at the 
level of genus, species or serotype. Correct 
identification is not only an essential tool in the 

20 laboratory but plays a significant role in the control 
of microbial contamination in the processing of food 
stuffs, production of agricultural products and 
monitoring of environmental media such as ground water. 
Increasing stringency in regulations which apply to 

25 microbial contamination have resulted in a corresponding 
increase in industry resources which must be dedicated 
to contamination monitoring. 

Of greatest concern is the detection and control of 
pathogenic microorganisms . Although a broad range of 

30 microorganisms have been classified as pathogenic, 
attention has primarily focused on a few bacterial 
groupings such as Escherichia, Salmonella, Listeria and 
Clostridia. Typically, pathogen identification has 
relied on methods for distinguishing phenotypic aspects 

35 such as growth or motility characteristics, and 
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immunological and serological characteristics. 
Selective growth procedures and immunological methods 
are the traditional methods of choice for bacterial 
identification, and can be effective for the presumptive 
5 detection of a large number of species within a 

particular genus. However, these methods are time 
consuming, and are subject to error. Selective growth 
methods require culturing and subculturing in selective 
media, followed by subjective analysis by an experienced 

10 investigator. Immunological detection (e.g., ELISA) is 
more rapid and specific, however it still requires 
growth of a significant population of organisms and 
isolation of the relevant antigens. For these reasons 
interest has turned to detection of bacterial pathogens 

15 on the basis of nucleic acid sequence. 

It is well known, for example, that nucleic acid 
sequences associated with the ribosomes of bacteria are 
often highly conserved across genera and are therefore 
useful for identification (Webster, U.S. 4,717,653 and 

20 U.S. 5,087,558; Enns, Russel K. Lab. Wed., 19, 295, 

(1998); Mordarski, M. Soc. Appl. Bacterid. Tech. Ser., 
20 (Chem. Methods Bact . Syst.), 41, (1985)). Weisburg 
et al., (EP 5173 6) disclose a method for the detection 
and identification of pathogenic microorganisms 

25 involving the PCR amplification and labeling of a target 
nucleotide for hybridization to 16S rDNA of E. coll. and 
Lane et al . , (WO 9015157) teach universal nucleic acid 
probes that hybridize to conserved regions of 23S or 16S 
rRNA of eubacteria. 

30 Although bacterial ribosomal nucleic acids contain 

highly conserved sequences, they are not the only 
sources of base sequence conservation that is useful for 
microorganism identification. Wheatcroft et al., 
(CA 2055302) describe the selection of transposable 

35 elements, flanked by unique DNA sequences for the 
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detection of various Rhlzoblum strains. Similarly 
Tommassen et al., (WO 9011370) disclose polynucleotide, 
probes and methods for the identification and detection 
of gram-positive bacteria. The method of Tommassen et 
5 al., relies on probes corresponding to relatively short 
fragments of the outer membrane protein OrapA, known to 
be highly conserved throughout gram-positive genera. 
Atlas et al., (EP 517154) teach a nucleic acid 
hybridization method for the detection of Giardla sp. 
10 based on designing probes with sequences complementary 
to regions of the gene encoding the giardin protein. 
Webster, J. A., (U.S. 4717653) has expanded upon the use 
of rRNA in disclosing a method for the characterization 
of bacteria based on the comparison of the 
15 chromatographic pattern of restriction endonuclease- 

digested DNA from the unknown organism with equivalent 
chromatographic patterns of at least 2 known different 
organism species. The digested DNA has been hybridized 
or reassociated with ribosomal RNA information- 
20 containing nucleic acid from, or derived from a known 
probe organism. The method, of Webster et al., 
effectively establishes a unique bacterial nucleic acid 
"fingerprint" corresponding to a particular bacterial 
genus against which unknown "fingerprints" are compared. 
25 The methods described above are useful for the 

detection of bacteria but each relies upon knowledge of 
a gene, protein, or other specific sequence known a 
priori to be highly conserved throughout a specific 
bacterial group. An alternative method would involve a 
30 nontargeted analysis of bacterial genomic DNA for 

specific non-phenotypic genetic markers common to all 
species of that bacteria. For example, genetic markers 
based on single point mutations may be detected by 
differentiating DNA banding patterns from restriction 
35 enzyme analysis. As restriction enzymes cut DNA at 
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specific sequences, a point mutation within this- site 
results in the loss or. gain of a recognition site, 
giving rise in that region to restriction fragments of 
different length. Mutations caused by the insertion, 
5 deletion or inversion of DNA stretches will also lead to 
a length variation of DNA restriction fragments. 
Genomic restriction fragments of different lengths 
between genotypes can be detected on Southern blots 
(Southern, E. M., J. Mol . Biol. 98, 503, (1975). The 

10 genomic DNA is typically digested with any restriction 
enzyme of choice, the fragments are elect rophoretically 
separated, and then hybridized against a suitably 
labelled probe for detection. The sequence variation 
detected by this method is known as restriction length 

15 polymorphism or RFLP (Botstein et al. Am. J. Bum. Genet., 
342, 314, (1980)). RFLP genetic markers are 
particularly useful in detecting genetic variation in 
phenotypically silent mutations and serve as highly 
accurate diagnostic tools. 

20 Another method of identifying genetic polymorphic 

markers employs DNA amplification using short primers of 
arbitrary sequence. These primers have been termed 
•random amplified polymorphic DNA', or "RAPD" primers, 
Williams et al;, Nucl. Acids. Kes., 18, 6531 (1990) and 

25 U.S. 5,126,239; (also EP 0 543 484 A2, WO 92/07095, 

WO 92/07948, WO 92/14844, and WO 92/03567). The RAPD 
method amplifies either double or single stranded 
nontargeted, arbitrary DNA sequences using standard 
amplification buffers, dATP, dCTP, dGTP and TTP and a 

30 thermostable DNA polymerase such as Tag. The nucleotide 
sequence of the primers is typically about 9 to 13 bases 
in length, between 50 and 80% G+C in composition and 
contains no palindromic sequences . RAPD detection of 
genetic polymorphisms represents an advance over RFLP in 

35 that it is less time consuming, more informative, and 
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readily susceptible to automation. Because of its 
sensitivity for the detection of polymorphisms RAPD 
analysis and variations based on RAPD/PCR methods have 
become the methods of choice for analyzing genetic 
5 variation within species or closely related genera, both 
in the animal and plant kingdoms. For example, Landry 
et al., {Genome, 36, 580, (1993)) discuss the use of 
RAPD analysis to distinguish various species of minute 
parasitic wasps which are not morphologically distinct. 
10 Van Belkum et al . , (Wol. Biochem Parasitol 61, 69, 

(1993) ) teach the use of PCR-RAPD for the distinction of 
various species of Giardi. 

In commonly assigned application USSN 07/990,297, 
Applicants disclose a method of double-nested PCR which 
15 is used to detect the presence of a specific microbe. 
This disclosure first describes identifying a random 
unique segment of DNA for each individual microorganism 
which will be diagnostic for that microorganism. To 
identify and obtain this diagnostic nucleic acid segment 
20 a series of polymorphic markers is generated from each 
organism of interest using single primer RAPD analysis. 
The RAPD series from each organism is compared to 
similarly generated RAPD series for other organisms, and 
a RAPD marker unique to all members of the group is then 
25 selected. The unique marker is then isolated, amplified 
and sequenced. Outer primers and inner primers suitable 
for double-nested PCR of each marker may then be 
developed. These primers comprise sequence segments 
within the RAPD markers, wherein the inner set of 
30 primers will be complementary to the 3' ends of the 

target piece of nucleic acid. These nested primers may 
then be used for nested PCR amplification to definitely 
detect the presence of a specific microorganism. 
In the present method Applicants have more 
35 particularly adapted and more fully described this RAPD 
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methodology to identify a sequence, or marker; the 
presence of which will be diagnostic for all individuals 
of a genetically related population. The present method 
first involves a RAPD amplification of genomic DNA of a 
5 representative number of individuals within a specific 
genus, species or subspecies to produce a RAPD 
amplification product, termed the diagnostic fragment. 
This diagnostic fragment must be present in the RAPD 
profiles in over 90% of the individuals tested. 

10 Sequence information from the diagnostic fragment will 
then enable identification of the most suitable PCR 
primer binding sites within the diagnostic fragment to 
define a unique diagnostic marker. Primers flanking 
this marker will be useful to produce an amplification 

15 product in the genetically selected group, but will not 
produce any amplification product in individuals outside 
of that group. 

An important aspect of the present invention is the 
identification of the most conserved primer binding 

20 sites within this diagnostic sequence., which is 

accomplished by first determining which individuals, in 
the genus or grouping to be detected, exhibit the most 
genetic variation within the diagnostic sequence. 
Screening this subpopulation of "most polymorphic" 

25 individuals using various primers generated from the 
diagnostic sequence will define the most highly 
conserved primer bindings sites within the diagnostic 
fragment. Primers directed toward these highly 
conserved primer binding sites are then useful for the 

30 detection of all members of the genus, based upon the 
ability of the selected primers to amplify the 
diagnostic marker present in that particular population. 
A "yes" or "no" answer can then be readily provided to 
the question of whether a microorganism is a member of 
.35 the genetically related population. If DNA 



WO 95/33854 



PCT/US95/06704 



amplification occurs using these primers, the target is 
present and the identity is confirmed as "yes". If 
amplification does not occur, the answer is no; the 
microorganism is not a member of that genetically 
5 related population. The necessity of electrophoresis to 
determine the presence of a marker of any particular 
size is eliminated. 

Applicants ' method is distinctive in that to 
accomplish detection of a member of a group of 

10 organisms, the. method first relies on determining the 
most conserved regions of a diagnostic fragment from a 
phenotypically uncharacterized segment of DNA common to 
all members of that group. One of skill in the art will 
recognize that conservation of sequence may represent 

15 both an ally and an enemy in the process of 

identification of the members of a particular genus. 
For example, many bacterial sequences are conserved 
across genera and hence would not be useful in the 
determination of species within a particular genus. It 

20 is precisely for that reason that methods heretofore 

elucidated in that art rely primarily on the analysis of 
sequences derived from proteins or genes known to be 
specific to a particular genus, i.e., ribosomal RNA or 
outer membrane proteins. Applicants 1 method departs 

25 from the art in that the conserved sequences of the 

instant method are not derived from a known gene, nor is 
the sequence associated with any known phenotypic 
characteristic. Further, Applicants 1 method is refined 
by the selection of the most conserved region of the 

30 diagnostic fragment by comparison with the genomic DNA 
of a subpopulation of individuals exhibiting the most 
genetic variation within the diagnostic fragment. 
Applicants 1 method presupposes that the regions of the 
diagnostic fragment most conserved within the 

35 polymorphic subpopulation will also be conserved within 
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the larger population comprising all members of the 
genus. Applicants are unaware of any art teaching this 
supposition. 

The process of the present invention has been 
5. enabled in the present disclosure by the elucidation of 
a diagnostic marker sequence which is useful in rapidly 
and definitively identifying bacteria from the genus 
Salmonella . 

SSMM&&X OF THE INVENTION 

10 The present invention provides a method for the 

determination of diagnostic genetic markers for the 
identification of individuals of a genetically related 
population of microorganisms. The method comprises the 
following steps: 

15 (i) The first step entails performing a RAPD 

amplification on the genomic DNA of a representative 
number of individuals from a genetically related 
population, wherein said number of individuals will 
comprise the positive test panel, and whereby the RAPD 

20 amplification performed on individuals of the positive 

test panel will generate a RAPD marker profile from each 
individual of the positive test panel. Similarly the 
same RAPD amplification. is performed on the genomic DNA 
of a significant number of individuals genetically 

25 unrelated to the positive test panel, wherein said 
number of genetically unrelated individuals will 
comprise the negative test panel, and whereby the RAPD 
amplification on individuals of the negative test panel 
will generate a RAPD marker profile from each individual 

30 of the negative test panel. 

(ii) The second step comprises comparing the 
RAPD marker profiles from individuals of the positive 
test panel with the RAPD marker profiles from 
individuals of the negative test panel and thereby 

35 selecting a diagnostic nucleic acid fragment wherein 
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said fragment is present in over 90% of the RAPD marker 
profiles from the positive test panel and absent in the 
RAPD marker profiles from the negative test panel. 

(iii) The nucleotide sequence of said 

5 diagnostic fragment is determined to identify available 
primer binding sites. 

(iv) One or more pairs of primers 
corresponding to the available primer binding sites of 
step (iii) are prepared. 

10 (v) Primer-directed amplification is 

performed on the genomic DNA of a significant number of 
individuals from the positive test panel using the 
primer pairs of step (iv), whereby a subpopulation of 
individuals which are the most polymorphic with respect 

15 to said diagnostic fragment is identified. 

(vi) Primer-directed amplif ication is next 
performed on the genomic DNA of said polymorphic 
subpopulation of (v) using several candidate primer 
pairs derived from the sequence of said diagnostic 

20 fragment , whereby a particular candidate primer pair 
which produces primer amplification product for the 
highest percentage of individuals within the polymorphic 
subpopulation is thereby empirically selected. This 
primer pair now defines the diagnostic marker for that 

25 genetically related population of step (i) . 

(vii) The method further comprises the step of 
confirming that the particular primer pair identified in 
(vi) is useful for amplifying a diagnostic genetic 
marker which is present in all of the genetically 

30 related individuals while absent in all of the 
genetically unrelated individuals, wherein said 
confirmation is accomplished by amplifying the genomic 
DNA of all individuals of the positive and negative test 
panels with said particular primer pair to determine 

35 that said primer pair is effective in amplifying a 
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diagnostic genetic marker in all individuals of the 
positive test panel and is ineffective in amplifying 
said diagnostic marker in all individuals of the 
negative test panel. 
5 This invention further provides a method of 

determining whether an unknown bacterium is a member of 
the genus Salmonella , comprising analyzing the genomic 
DNA of said unknown bacterium to detect the presence of 
nucleic acid Sequence ID No. 1 or its complement, 

10 No. 20. In a preferred embodiment, said analysis can be 
accomplished by amplification using the primer pairs of 
Sequence ID Nos . 15 and 19. 

This invention further provides isolated nucleic 
acid fragments having Sequence ID Nos. 1, 4, 14, 15 f 16, 

15 17, 18, 19, 10, 21 and 22. 

RRTFF DESCRIPTION OF THE FIGURES 

Figure 1A is a composite photograph showing 
electrophoretic marker profiles of amplification 
products for the positive test panel of Salmonella 

20 strains amplified with a single RAPD primer, 12CN03 
(Sequence ID No. 4) . 

Figure IB is a composite photograph showing 
electrophoretic marker profiles of amplification 
products for DNA from the negative test panel comprising 

25 a variety of non-Saljnonella bacterial strains amplified 
with a single RAPD primer, 12CN03 _ (Sequence ID No. 4), 
Figure 2 is the sequence of an 811 bp Salmonella 
diagnostic nucleic acid fragment, Sequence ID No. 1, 
which was generated by amplification of genomic DNA 

30 isolated from Salmonella typhimurlum (ATCC 29057) with 
the single 12-base primer 12CN03. The complementary 
strand to Sequence ID No. 1 is Sequence ID No. 20. 
Within this 811 bp nucleic acid of Figure 2, at position 
No. 35 to 786, is Sequence ID No. 21 and its complement, 
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Sequence ID No. 22, which comprise the diagnostic marker 
of the invention for Salmonella. 

Figure 3 is a composite photograph showing normal 
(N) and polymorphic (P) electrophoretic PCR 
5 amplification products generated from the primers 
54-23/665rc-23 (Sequence ID Nos. 10/13) and primers 
126-23/ 648rc-23 (Sequence ID Nos. 11/12) from a variety 
of Salmonella strains. 

Figure 4 is a composite photograph showing PCR 
10 amplification of a variety of non-Salmonella strains 
using primer #60-26 (Sequence ID No. 15) and primer 
#761rc-26 (Sequence ID No. 19). 

Figure 5 is a composite photograph showing PCR 
amplification of a variety of Salmonella strains using 
15 primer #60-26 (Sequence ID No. 15) and primer #761rc-26 
(Sequence ID No. 19) . 

pflTATT.F.n DFSP PTPTTON OF THK INVENTION 

As used herein the following terms may be. used for 
interpretation of the claims and specification. 

20 "Nucleic acid" refers to a molecule which can be 

single stranded or double stranded, comprising monomers 
(nucleotides) containing a sugar, phosphate and either a 
purine or. pyrimidine. In bacteria, lower eukaryotes, 
and in higher animals and plants, "deoxyribonucleic 

25 acid" (DNA) refers to the genetic material while 

"ribonucleic acid" (RNA) is involved in the translation 
of the information from DNA into proteins . 

The term "primer-directed amplification" refers to 
any of a number of methods known in the art that result 

30 in logarithmic amplification of nucleic acid molecules 
using the recognition of a specific nucleic acid 
sequence or sequences to initiate an amplification 
process. Applicants contemplate that amplification may 
be accomplished by any of several schemes known in this 

35 art, including but not limited to the polymerase chain 
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reaction (PCR) or ligase chain reaction (LCR) . If PCR 
methodology is selected, the amplification method would 
include a replication composition consisting of for 
example, nucleotide triphosphates, two primers with 
5 appropriate sequences, DNA or RNA polymerase and 
proteins. These reagents and details describing 
procedures for their use in amplifying nucleic acids are 
provided in U.S. Patent 4,683,202 (1987, Mullis, et al.) 
and U.S. Patent 4,683,195 (1986, Mullis, et al.). 
10 - A "diagnostic fragment" refers to a particular DNA ^ 

sequence which is highly conserved amongst the 
individuals of a particular genetically related 
population, for example, a genus, species, or subspecies 
of bacteria. In the instant invention, the term 
15 "diagnostic fragment" is used to refer to that fragment 
generated during RAPD amplification which is present in 
the RAPD profiles from a particular related group but 
absent in profiles from individuals outside of that 
group. The term "diagnostic marker" is used herein to 
20 refer to that portion of the diagnostic fragment which 
can be targeted to produce an amplification product in 
only members of the related group. The diagnostic 
marker is not present outside the related group, and 
attempts to amplify the diagnostic markers in 
25 individuals outside of the related group will result in 
no -nucleic acid being amplified. 

The term "primer" refers to a nucleic acid fragment 
or sequence that is complementary to at least one 
section along a strand of the sample nucleic acid, 
30 wherein the purpose of the primer is to sponsor and 
direct nucleic acid replication of a portion of the 
sample nucleic acid along that string. Primers can be 
designed to be complementary to specific segments of a 
targeted sequence. In PCR, for example, each primer is 
35 used in combination with another primer forming a. 
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"primer set" or "primer pair", this pair flanks the 
targeted sequence to be amplified. In RAPD 
amplification, single arbitrary primers are used to 
amplify nontargeted segments of nucleic acid which are 
5 located between the primer sequence sites in opposing 
DNA strands. The term "primer", as such, is used 
generally herein by Applicants to encompass any 
sequence-binding oligonucleotide which functions to 
initiate the nucleic acid replication process. 

10 "Diagnostic primers" will refer to primers designed with 
sequences complementary to primer binding sites on 
diagnostic marker. Diagnostic primers are useful in the 
convenient detection and identification of individuals 
of a genetically related population. 

15 A genetically related population refers to any 

grouping of microorganisms possessing multiple or single 
phenotypic characteristics of sufficient similarity to 
allow said organisms to be classified as a single genus, 
species, or subspecies of bacteria. For purposes of the 

20 present disclosure, examples of genetically related 

populations include, for example, the genus Salmonella 
or the species Listeria monocytogenus . 

A "test panel" refers to a particular group of 
organisms or individuals selected on the basis of their 

25 genetic similarity to each other, or their genetic 
dissimilarity to another group (i.e., another genus, 
species, subspecies) . A "positive test panel" will 
refer to a number of individuals selected for the 
desired genetic similarity between those individuals, 

30 and in the instant case will be comprised of individuals 
included within the desired genetically related 
population. Examples of a positive test panel would be, 
for example, representative members of all the species 
of a particular genus (assuming that genus is the 

35 desired 'genetically related population').. Similarly, a. 
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"negative test panel" will refer to a test panel 
selected on the basis of genetic diversity between its. 
members and the members of the positive test panel. An 
example of a negative test panel when the positive test 
5 panel is bacteria of the genus Salmonella, would be 

bacteria and other organisms outside of the Salmonella 
genus . 

The term "representative number of individuals" 
refers to individuals within a genetically related 

10 population which are selected such that they represent 
the widest possible range of biochemical/ morphological 
and immunological attributes known to exist within the 
targeted genetically related population. The term 
"representative number of individuals", when referring 

15 to individuals genetically unrelated to the genetically 
related population (the negative test panel) , means 
those microorganisms which are not included within the 
genetically related group but are genetically similar to 
that group • 

20 The term "unknown bacterium" refers to a bacterium 

whose identity is unknown. 

The term "derived from", with reference to an 
amplification primer, refers to the fact that the 
sequence of the primer is a fragment of the sequence 

25 from which it has been "derived". The fragment is 

always denoted in a 5' to 3' orientation. The useful 
primer sequence size range for PCR amplification is 
about 15 base pairs to about 30 base pairs in length. 
The term "RAPD" refers to 'random amplified 

30 polymorphic DNA' . "RAPD amplification" refers to a 
method of single primer directed amplification of 
nucleic acids using short primers of arbitrary sequence 
to amplify nontargeted, random segments of nucleic acid. 
U.S. 5,126,239. "RAPD method" or "RAPD analysis" refers 

35 to a method for the detection of. genetic polymorphisms 
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involving the nontargeted amplification of nucleic acids 
using short primers of arbitrary sequence , whereby the 
profile or pattern of 'RAPD 9 amplification products is 
compared between samples to detect polymorphisms. "RAPD 
5 primers" refers to primers of about 8 to 13 bp, of 

arbitrary sequence, useful in the RAPD amplification or 
RAPD analysis according to the instant method. The 
"RAPD marker profile" refers to the pattern, or 
fingerprint, of amplified DNA fragments which are 

10 amplified during the RAPD method and separated and 
visualized by gel electrophoresis. 

The diagnostic marker of the invention, once 
identified, can be used to identify an unknown 
microorganism by any of several analysis methods. In 

15 the present invention, primers flanking the marker are 
useful to amplify the marker using PCR. Alternatively, 
nucleic acid probes could be developed based upon some 
or all of the diagnostic marker sequences and thus used 
to detect the presence of the marker sequence using 

20 standard hybridization and reporter methods. It is 

contemplated that regions of about 30 base pairs or more 
of the diagnostic marker, especially encompassing the 
primer regions could be used as sites for hybridization 
of diagnostic probes . 

25 The present invention provides a method for the 

determination of genetic markers useful in the detection 
and identification of all members of a genetically 
related population. Examples of genetically related 
populations include following: 

30 1) microorganisms belonging to the genus 

Salmonella 

2) microorganisms belonging to the species 
Listeria monocytogenes 

3) microorganisms belonging to the serotype of 
35 Escherichia coll designated 0157:H7. 



WO 95/33854 



PCT/US95/06704 



16 

The instant method is particularly useful fox the 
detection of specific genera , species or subspecies of- 
bacteria which may be present either in food, human or 
animal body fluids or tissues, environmental media or 
5 medical products and apparatti. 

To practice the instant method, a RAPD 
amplification, using a short arbitrary primer, is 
performed on the genomic DNA of at least 30 individuals 
from a genetically related population. These 

10 individuals are selected such that they represent the 

widest possible range of biochemical, morphological and 
immunological attributes known to exist within the 
targeted genetically related population* The electro- 
phoretically resolved patterns of amplification products 

15 produced by the RAPD amplifications are then compared, 
in hopes of indentifying a distinctive RAPD 
amplification product which is present in over 90% of 
the individuals tested. If this product is not found 
when the same RAPD amplification is then performed on 

20 the genomic DNA of at least 30 strains of microorganisms 
which fall outside of the targeted population, then this 
fragment is deemed to be suitable diagnostic fragment 
and it is then sequenced to determine suitable primer 
binding sites for further analysis and primer 

25 generation. It is imperative that the most conserved 

regions of the diagnostic fragment be determined for the 
generation of useful diagnostic primers, i .e . , primers 
which will be capable of producing an amplification 
product in all members of the genetically related group. 

30 Determination of the most conserved region is 

accomplished by first determining which individuals, in 
the population group to be detected, exhibit the most 
genetic variation within the diagnostic fragment 
sequence. The genomic DNA of this polymorphic 

35 subpopulation is then analyzed with several sets of PCR 
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primers generated from the diagnostic fragment ta define 
the most highly conserved PCR primer bindings sites 
within the diagnostic fragment. Primers generated from 
these highly conserved primer binding sites are then 
5 used in assay methods for the detection of all members 
of the genus. The method is more particularly described 
below with reference to the specific method steps as 
provided in the Summary of the Invention. 
Selection of RAPD primpr.q and detec t- i on of diagnostic 
10 fragment in memb ers of the positive and negative control 

panels f steps ( i ) and (ii) : 

Genomic DNA isolated from positive and negative 
test panels of microorganisms was subjected to RAPD 
amplification using eight 12-base primers of arbitrary 
15 sequence. The positive test panel comprised 62 

Salmonella serotypes and is described in detail in the 
GENERAL METHODS section below. The negative test panel 
consisted of a variety of 11 non- Salmonella species and 
is also described in the GENERAL METHODS section below. 

20 Techniques for the isolation of genomic DNA are common 
and well known in the art and examples may be found in 
Sambrook et al., Molecular Cloning: A Laboratory Manual 
- volumes 1,2,3 (Cold Spring Harbor Laboratory: Cold 
Spring Harbor/ New York. 

25 RAPD primers of 12 bases in length were used 

because at this primer length the RAPD patterns 
generally contained one to five amplified DNA fragments. 
Use of shorter primers frequently resulted in a large 
number of amplification products, which made the 

30 extraction of a single homogeneous fragment for 

sequencing much more difficult. When primers of greater 
than 12 bases were used a significant fraction of the 
bacterial strains produced no RAPD products which would 
have necessitated the screening of a much larger number 

35 of arbitrary primers. One of the primers, designated 
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12CN03, was found to produce both an 800 bp and 2000 bp 
amplification product in over 90% of the positive test 
panel. 12CN03 had the sequence of TTA GTC ACG GCA 
(Sequence ID No. 4) . Neither the 800 bp nor 2000 bp 
5 fragment was seen in the amplification products of the 
negative test panel with primer 12CN03 . Because of its 
shorter length it was decided to focus attention on the 
800 bp fragment for further analysis and this became the 
Salmonella diagnostic fragment. Figure 2 shows this 
10 fragment, wherein the top strand is shown as Sequence ID 
No. 1, and its complementary strand is shown as Sequence 
ID No. 20. 

The 800 bp fragment did not appear with equal 
intensity in all of the Salmonella strains in the 

15 positive test panel. Considering the extreme 

sensitivity of RAPD patterns to sequence polymorphisms, 
it is assumed that the variations in the intensity of 
the RAPD marker in some Salmonella strains was the 
result of a minor sequence variation in the vicinity of 

20 the primer site. Considering the frequency at which the 
Salmonella fragment appeared/ it could still be possible 
for highly conserved sequences, which are common to all 
members of the genus Salmonella, to be found between the 
12CN03 priming sites flanking the 800 bp fragment. 

25 Sequencing nf diagnostic fragment, ste p (iii) : 

The 800 bp product for Salmonella typhlmurium 587 
(ATCC #29057) was selected for extraction and 
sequencing. This strain was selected because it is a 
well-characterized type strain and because this serotype 

30 of Salmonella is a frequently encountered pathogenic 

microorganism. The amplification product was isolated 
by gel electrophoresis and the fragment was cut from the 
gel, eluted and reamplified with the 12CN03 primer to 
provide quantities of DNA suitable for sequencing. 

35 Sequencing was accomplished using the chain-termination 
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method of Sanger et al. (Proc. Natl. Acad. Sci. USA 74, 
5463, (1977)) using fluorescence-labeled 
dideoxynucleotides and the Genesis 2000™ DMA -Analysis 
System (E . I. du Pont de Nemours and Company, 
5 Wilmington, DE) - The complete sequence of the 800 bp 
Salmonella diagnostic fragment is shown in Figure 2. 
Trfpnfu^ a tjnn ~* moat hiahiv connffrvftri regions of 
the dlagj a aatJLs fragment, stppa fiv) nnd (v) : 

In order for the Salmonella diagnostic fragment to 

10 be useful for the detection of all members of the 

Salmonella genus, it is necessary to identify the most 
conserved regions (i.e., primer sites) of the diagnostic 
fragment. In theory, identification of the conserved 
regions could be accomplished by generating primers to 

15 the fragment based on the known sequence and isolating 

and sequencing the same fragment from all members of the 
Salmonella genus. Sequencing, alignment and comparison 
of all the sequences would allow for the determination 
of the most conserved portion of the sequence. 

20 Although, this approach is theoretically possible, in 
reality it is prohibitively time consuming and 
expensive . The development of a general method requires 
an alternate approach. 

The instant method provides a more direct and rapid 

25 method of identifying the most conserved regions of the 
diagnostic fragment, wherein the first step is the 
identification of a subpopulation of Salmonella sp. 
which show the greatest overall variation within the 
800 bp diagnostic fragment. The strains which 

30 constitute this subpopulation are referred to as 

"polymorphic" Salmonella. It must be understood that 
this subpopulation is defined as polymorphic only in the 
context of the diagnostic nucleic acid fragment shown in 
Figure 2 and not with respect to the classical 

35 biochemical and morphological attributes, -.which are 
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commonly used to classify species. Once the members of 
the positive test panel that are polymorphic have been 
identified, these polymorphic Salmonella are used to 
screen for the most highly conserved regions of the 
5 diagnostic fragment. This approach presumed that the 
priming sites that are conserved among the most 
polymorphic Salmonella are also conserved in the general 
population of Salmonella. 

In order to determine which Salmonella were 

10 "polymorphic" two sets of amplification primer pairs 

were arbitrarily selected from the diagnostic fragment 
and amplifications were carried out on DNA isolated from 
740 strains of Salmonella representing all six subgenus 
groups for each of the primer sets. The initial primer 

15 sets were selected to achieve a GC content of 55 ± 3% 

for two pairs of primers all of which are located within 
200 bases of the CN03 priming sites. Any strain of 
Salmonella which showed an amplification polymorphism 
was classified as a "polymorphic" Salmonella. The 

20 following amplification events were regarded as 

polymorphic when they occurred with either primer set: 

i) weak, inconsistent, or total lack of 
production of an amplification product 

ii) amplification products which are larger 

25 or smaller than the generally observed amplification 
product 

iii) the presence of more than one 
amplification product. 

The largest single polymorphic group among the 740 
30 strains of Salmonella were those which produced no 

amplification product with at least one of the primer 
pairs. However, a number of strains produced either 
multiple amplification products or products of a 
different size. Some examples of these types of 
35 polymorphic amplification events are shown in Figure 3. 
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From the original group of 740 Salmonella strains a 
group of 43 polymorphic Salmonella were selected. 
Selection of a riiacmoslHff nrlmer pair to amplify the 

diagnostic aen fttic markpr. fitep (vi) : 

5 Once the subpopulation of "polymorphic" Salmonella 

was identified primers were prepared for a large number 
sites at both ends of the Salmonella fragment sequence. 
The initial criteria for primer selection was that the 
GC content of the two primers should match and that the 
10 overall GC content fell in the range of 55 ± 3%. The 
second criteria was that the pairs of primers were all 
located within 200 bases of the CN03 priming sites. 
Using these primers amplifications were carried out on 
genomic DNA from the polymorphic Salmonella. Primer 
15 combinations which produced an amplification product in 
over 90% of the polymorphic Salmonella were selected for 
further evaluation. In such combinations/ one of the 
primer sites was "locked" while the second priming site 
was moved upstream or downstream one base at a time. In 
20 this way the priming site that found the highest portion 
of polymorphic Salmonella was identified and fixed. The 
second priming site was then "locked" and additional 
primers were prepared, which moved the first priming 
site at the other end of the Salmonella target sequence 
25 upstream or downstream one base at a time. When the 
priming sites which produced an amplification product 
for the highest percentage of polymorphic Salmonella 
were identified, these primers were then evaluated for 
the entire test panel of Salmonella strains. Based on 
30 this analysis four regions were identified as being most 
conserved. Within these conserved regions five primer- 
pair combinations were capable of producing an 
amplification product in £ 95% of the polymorphic 
Salmonella. These primer combinations were selected for 
35 further testing. 
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Confirmation nf s plftrted primer pair- as a diagnostic 
genetic marker, stffl (viii) : 

The selected priming sites were understood to be 
highly conserved among the "polymorphic" Salmonella. 
5 The initial step in the final screening procedure was 
the determination of which, if any, priming sequences 
were conserved outside the genus Salmonella, The 
selectivity of the Salmonella primer sets was evaluated 
using a negative test panel consisting of over 100 

10 strains representing 28 species which were either 

similar phenotypically to Salmonella or likely to be 
found in similar environments. The primer combination 
which showed the lowest rate of false positive responses 
in the negative test panel was then evaluated to 

15 determine its inclusivity for a positive test panel 
consisting of over 14 80 Salmonella strains. 

EXAMPLES 

ffF.WFRAL METHODS 

Suitable methods of genetic engineering employed 
20 herein are described in Sambrook et al., Molecular 
Cloning: A Laboratory Manual - volumes 1,2,3 (Cold 
Spring Harbor Laboratory: Cold Spring Harbor, New York, 
1989), and in the instructions accompanying commercially 
available kits for genetic engineering. GeneClean 
25 (BiolOl LaJolla, CA) was used to isolate nucleic acid 
fragments from agarose gels and to remove enzymes from 
restriction digests and was performed as specified by 
the manufacturer. Unless otherwise specified all other 
standard reagents and solutions used in the following 
30 examples were supplied by J. T. Baker Co. (Phillipsburg, 
NJ) . 

• Construction of Positive and Negative Te st Panels 

For the identification of a genus level Salmonella 
RAPD marker a positive test panel consisting of a 
35 variety of Salmonella subgenera was constructed to 
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TaZT T ^ W ° Uld ±nClUde a broad -nge of 

strains . The posit±ve test contained 

of thefollowxng Salmonella serotypes: Subgenus I; 

5. t^phimurium, 5 . tOTM/ s> enterltldls, S. saintpaul, 

S. blnza, S . napollf s> cJerArhreJJf s< ijjfantiS/ . 

S. newport, s. heidelberg, s. virchow, S. Stanley, 

S. senftenberg, s. gallinarivm, s. cholerasvis, 

S. paratyphi, S . bredeney, s. kedougou, S. montevideo, 

S hadar, s. panama, s. braenderup, s. blockley, 

S agona, s. brandenberg, s. anatum, s. thompson, 

*. *>erta, s. manchester, s. ealing, s. eastbourne, 

S. Indiana, s. weltevreden, S. bracknell, 

S IZTr* 1 * 10 ™' S ' bareilly ' S - briSt ° 2 > *• ^gen, 
15 a b TL S ' birk±nhead ' S - "ti-- S. amager, 

S. bluk„a, s. bonn, S. brazil, s. bvtantan, 

S. bodjonegro, S. adelaide, S. allandale, 

S. albuguergue, S. aeguatoria, abaetetube, S. alabana, 

S. alachua, and S. Chicago; Subgenus II; s. artis 

20 I' lTTT iR ' s ' bulawayo ' s - bleadon > s - *^°*y> 

S basel; Subgenus Ilia; s. arizonae; Subgenus V; 
S. brookfield. 

The negative test panel in the screening for a -RAPD 
SP6CifiC t0 «. consisted of the follow^ 

spec.es; Escherichia coli, Escherichia blattae, 

, " ' Shi ** lla ^nei, Shigella flexneri, Shigella 

Citrobacter freundii. These species represent a 
sampling of strains which are not included within the 
0 genus Sai.one.ia but are genetically similar to 

Sa 2nionella . if strains representing these species show 

a substantially different RAPn „»«.*.. u 

* ilierent RAPD pattern when amplified 
with the arbitrary p r i mer used to generate the 
Salmonella marker, and if the selected Salmonella marker 
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is absent from the pattern, it is expected that the 
marker sequence will be selective for Salmonella. 

EXAMPLE 1 

ISOLATION OP DIAGNOSTIC FRA GMENT FROM SALMONELLA SP , 
5 RAPD Screen Test Results: 

Genomic DNA was isolated from members of both the 
positive and negative test panel members (above) and 
used to screen eight, 12-base primers of arbitrary 
sequence . These primers were used to generate RAPD 
10 patterns for strains representing the positive and 

negative test panels. The primers used in the initial 
RAPD screening are listed in Table I . 

TABLE I 

Tw fi iv ft -R aR P Arbitrary Primers Used in the 
Generation of RAPD Pat-terns for the Purpose of 
Identifying a Specific Gen us Level Salmonella Marker 



12CN01 




AGC 


TGA 


TGC 


TAC 


(Sequence 


ID 


No. 


2) 


12CN02 




AGT 


CGA 


ACT 


GTC 


(Sequence 


ID 


NO. 


3) 


12CN03 




TTA 


GTC 


ACG 


GCA 


(Sequence 


ID 


No. 


4) 


12CN04 




TGC 


GAT 


ACC 


GTA 


(Sequence 


ID 


No. 


5) 


12CN05 




CTA 


CAG 


CTG 


ATG 


(Sequence 


ID 


No. 


6) 


12CN06 




GTC 


AGT 


CGA 


ACT 


(Sequence 


ID 


No. 


7) 


12CN07 




GGC 


ATT 


AGT 


CAC 


(Sequence 


ID 


No. 


8) 


12CN08 




CGT 


ATG 


CGA 


TAC 


(Sequence 


ID 


No. 


9) 



The primers were used individually and as mixed 
pairs in the following amplification protocol; 

15 For each 50 \il reaction, 2 fll - dNTP mix (5 mM dNTP 

each) , 35 111 deionized water, 5 \ll - 10X reaction buffer 
(500 mM KC1, 100 mM tris @ pH 8.3, 15 mM MgCl 2 , 0.003% 
gelatin), 2.5 fll - each primer (10 mM) (5 Jll if only one 
primer is used), 0.4 [ll Taq polymerase (5 U/Jll) , and 

20 1.2 ^11 Taq dilution buffer (10 mM tris 0 pH 8.0 and 1.0% 
Tween 20) were combined. 1.0 |ll - genomic bacterial DNA 
@ 50 ng/p.1 was added. The reaction was heated to 94°C 
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for 5 minutes. 32 cycles of the following temperature 
cycle were run; 1' 0 94°, 5' @ 46°, 2 9 ramp to 72°C, and 
2' <§ 72°C. A 5 |ll aliquot of the reaction was combined 
with 2 jxl of Ficol-loading buffer and run on a 4% 
5 acrylamide gel (29:1) /l. Ox TBE. 

Figure 1A shows the RAPD patterns as separated by 
gel electrophoresis for samples of 16 different species 
of Salmonella from the positive test panel which was 
amplified with a single primer, 12CN03 . The lanes are 
10 correlated with the Salmonella species as. follows: 



Lane 
1 


Species and I.D. No. 


Lane 




Species and I.D 


S. typhlmurlum 587 
<ATCC 29057) 


9 


S. 


infant Is 728 


2 


S. typhlmurlum 588 
(ATCC 29631) 


10 


S. 


heidelberg 577 


3 


5. binza 1085 


11 


S. 


virchour 738 


4 


5. napoli 966 


12 


S. 


Stanley 739 


5 


5. enterltldls 1109 


13 


S. 


senftenberg 740 


6 


5. enteritidis 737 


14 


5. 


gallinarlum 741 


7 


S. newport 707 
(ATCC 6962) 


15 


S. cholerasuis 917 
(ATCC 13312) 


e 


5. arlzonae 725 
<ATCC 13314) 


16 


S- paratyphi A 918 
(ATCC 9150) 



Standard amplification conditions for amplification of 
DNA from the positive test panel consisted of 0,2 mM 
dNTPs, 1 JIM 12CN03 primer and a reaction buffer of 50 mM 
KC1, 10 mM tris @ pH 8.3, 1.5 mM MgCl2, and 0.0003% 

15 gelatin, A total of 32 cycles were run under the 

following conditions: 1' at 94°C; 5 f at 46°C; 2' ramp 
to 72°C and 2 f at 72°C. The final cycle was followed by 
an additional 9 1 at 72°C Unlabeled lanes contain 
molecular weight markers of the following sizes; 228 , 

20 412, 693, 1331, and 2306 base pairs (bp) . RAPD 
amplification products were elect rophoresed in 4% 
acrylamide/bisacrylamide (29/1) using a 1.0 X tris- 
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borate-EDTA running buffer for 55 minutes at a field 
strength of 14V/cm. Following electrophoresis the gels 
were stained for 15 minutes in a solution of ethidium 
bromide at 0.25 ^ig/ml. 

As is evident by Figure 1A the positive test panel 
produced two characteristic amplification products of 
800 and 2000 bp, which appeared in over 90% of the 91 
Salmonella strains tested. 

Figure IB shows the RAPD patterns as separated by 
gel electrophoresis for samples of 13 different species 
of a variety of Salmonella bacteria from the negative 
test panel which were amplified with a single primer, 
12CN03. The lanes are correlated with the bacterial 
species as follows : 



Lane Sponiea and I.D. No. 

1 Shigella sonnei 702 

2 Shigella flexneri 1083 
(ATCC 29903) 

3 Shigella dysenteria 1082 
(ATCC 13313) 

4 Shigella boydii 1081 
<ATCC 8700) 

5 Citrobacter diveasus 97 



Citrobacter freundii 383 
(ATCC 8700) 



Lanft Spgniea and I. D. No. 

9 Escherichia coli 90 

10 Escherichia blattae 84 6 
(ATCC 29907) 

11 Escherichia fregusonii 847 
(ATCC 35469) 

12 Escherichia hermani 848 
(ATCC 33650) 

13 Escherichia vulneris 850 
(ATCC 33821) 



15 Standard amplification conditions for the 

amplification of the negative test panel consisted of 
0.2 mM dNTPs, 1 HM 12CN03 primer and a reaction buffer 
of 50 mM KC1, 10 mM tris @ pH 8.3, 1.5 mM MgCl2, and 
0.0003% gelatin. A total of 32 cycles were run under 

20 the following conditions: 1' at 94°C; 5' at 46°C; 2 1 
ramp to 72°C and 2' at 72°C. The final cycle was 
followed by an additional 9' at 72°C. Molecular weight 
markers, gel composition, electrophoresis and staining 
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conditions were as described above for the positive test 
panel. 

As is evident by the data in figure IB, none of the 
negative test panel group showed the 800 bp or 2000 bp 
5 amplification products seen in the positive test panel. 
Sxt-.racrh.lon and SftCfU P nnina of the Salmonella d i agnost i c 

Fragment ; 

The 800 bp product for Salmonella typhlmurium 587 
(ATCC #29057) was selected for extraction and 

10 sequencing. The amplification' product was isolated by 
electrophoresis in a low melting point agarose. The 
fragment was cut from the gel and extracted onto 
GlassMilk™ using a customized procedure from the 
Geneclean kit sold by Bio 101 Inc. The fragment was 

15 then eluted and reamplified with the 12CN03 primer to 
provide quantities of DNA suitable for sequencing. 

Since both ends of the fragment contain the same 12 
base sequence, priming the parent diagnostic fragment 
with the 12CN03 primer would result in the production of 

20 two simultaneous sequences superimposed upon each other, 
which could not be resolved into the individual single- 
stranded sequences. Hence, it was necessary to carry 
out a restriction endonuclease digestion of the 
amplified 12CN03 product prior to running the sequencing 

25 reaction. Digest products were separated 

electrophoretically in low melting agarose and the 
appropriate restriction product was reisolated using the 
Geneclean procedure. The individual purified 
restriction digest products were then sequenced using 

30 12CN03 as a sequencing primer. The restriction fragments 
were sequenced by the Sanger chain— termination method 
using fluorescence-labeled dideoxynucleotides and the 
Genesis 2000™ DNA Analysis System. 

An example of the sequencing protocol used is as 

35 follows: 
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Combine 1.5 JXl - purified digest product (est. 
100 ng), 3.5 \ll - 12CN03 G 10.0 ng/^ll and 28.5 (Ll - H 2 D 
and heat to 95°C for 2 minutes . Immediately place the 
mixture on wet ice. Add the following mixture 10 \ll - 
5 5X reverse transcriptase reaction buffer (300 mM tris @ 
pH 8.3, 375 mM NaCl, 37.5 mM MgCl 2 ) , 6.5 fll - dNTP stock 
(180 uM ea.), 0.65 Jil - ddNTP stock (250 JIM 505nm-ddGTP, 
800 \HA 512nm-ddATP, 210 JIM 519nm-ddCTP and 700 \M 
526nm-ddTTP) and 1 Jil - reverse transcriptase. Vortex, 
10 centrifuge and then incubate at 46°C for 15 minutes. 
Separate the sequencing products on a spin column and 
vacuum dry. Wash with 150 Jil of cold 70% ethanol and 
centrifuge 5 min . Vacuum dry and reconstitute in 3 pi 
f ormamide . 

15 The labeled sequencing products were then analyzed 

by the Genesis 2000™ DNA Analysis System. Once 
differential sequence had been determined at both ends 
of the Salmonella target fragment the remaining sequence 
information was obtained through the use of either 

20 asymmetric PCR to generate single-stranded DNA or a 

modified double-stranded DNA sequencing protocol using 
double-stranded PCR product. The modification in the 
double-stranded protocol consisted of using a 4 6°C 
annealing temperature and a primer : template ratio of 

25 25:1. This ratio is significantly higher than is 

generally practiced in sequencing reactions. At such a 
large primer : template ratio, priming at multiple sites 
is generally observed with single-stranded templates. 
However, when the template consists of short linear 

30 double-stranded DNA, successful priming can only occur 
at 5' blunt ends of the template and only with a primer 
whose sequence matches that end. The net result is that 
only a single discrete sequencing product is observed 
under these conditions. The sequence of the complete 

35 Salmonella fragment is shown in Figure 2. 
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EXAMPLE 2 

nF.Ty.RMTNA TTOW OF POLYMORPHIC 
POSTTTVE TES T E&HEL STRAINS 

The following procedure was used to determine which 
strains of Salmonella were most "polymorphic" with 
respect to the sequence of the diagnostic fragment shown 
in Figure 2. Two sets of amplification primer pairs 
were arbitrarily selected from the marker sequence. The 
sequence of these primers is shown in Table II. 

TABLE II 

Primers nflori in fcfrfi dete n t Pflt -io" Bf T>olVfflOrT>ll 1 C Sfl i fflflnel i fl 

#54-23 GAC GCT TAA TGC GGT TAA CGC CA (Sequence ID No. 10) 

#126-23 AAC CAT GCA TCA TCG GCA GAA CG (Sequence ID No. 11) 

#648rc-23 AGT AGC CTG CCG CTT ACG CTG AA (Sequence ID No. 12) 

#665rc-23 TCA GGA TGC AGG CGA TAG TAG CC (Sequence ID No. 13) 



10 Primer nomenclature: 

The first number indicates the 3 f position of the 
primer on the Salmonella target sequence in Figure 2. 
The rc indicates that the primer sequence is derived 
from the reverse complementary strand. The 23 indicates 

15 the length of the primer. 

Amplifications were carried out on DNA isolated 
from 740 strains of Salmonella representing all six 
subgenus groups for each of the primer sets, 
54-23/665rc-23 and 126-23/648rc/23 . Standard 

20 amplification conditions consisted of 0.2 mM dNTPs, 

0.5 (iM each primer and a reaction buffer of 50 mM KC1, 
10 mM tris 0 pH 8.3, 1.5 mM MgCl2, and 0.0003% gelatin. 
A total of 35 cycles were run under the following 
conditions: 15 seconds at 94°C; 2 minutes at 69°C and 

2 5 1 minute at 72°C. The final cycle was followed by an 
additional 7 minutes at 72°C. Gel composition, 
electrophoresis and staining conditions were as 
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described above for the positive test panel in 
Example 1 . 

Strains of Salmonella were classified as a 
"polymorphic" if they produced amplification products 
5 that fell into the following categories: 

i) weak, inconsistent, or total lack of 
production of an amplification product; 
ii) amplification products which are larger 
or smaller than the generally observed 
10 amplification product; 

iii) the presence of more than one 
amplification product. 
Examples of these types of polymorphic 
amplification events are shown in Figure 3. Figure 3 
15 shows the amplification product patterns as separated by 
gel electrophoresis for samples of 6 polymorphic and 6 
normal Salmonella amplified with the primers of 
Table II. The lanes are correlated with the Salmonella 
strains as follows: 



Lane 
1 


Spp^ies and I.D__ 


NO... 


Lane 




Species 


and T.D. No. 


S. arizonae 1573 




7 


5. 


Subgenus 


Group XX 1514 


2 


S. arizonae 1572 




8 


S. 


Subgenus 


Group V 1535 


3 


5. typhxmurium 708 
(ATCC 13311) 




9 


5. 


Subgenus 


Group IV 1714 


4 


S. arizonae 726 
(ATCC 12324) 




10 


5. 


Subgenus 


Group V 1773 


5 


S. oranienburg 2212 




11 


5. 


Subgenus 


Group I 1513 


6 


5. Subgenus Group I 


2213 


12 


5. 


Subgenus 


Group X 1517 



20 From the original group of 740 Salmonella strains a 

group of 43 polymorphic Salmonella were selected. 
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EXAMPLE 3 

"F! VAT.TIAT TON OF PRTMTNG SLU ES WTTHTN THE 
DTAGNQSTTP FRAGM ENT FOR TWE BEST 

fip;yrns t,evet, TwrT.u fiTVTTY of xat.monella 
5 Example 3 illustrates the method used to identify 

which priming sites within the diagnostic Salmonella 
fragment showed the best inclusivity for Salmonella at 
the genus level. 

Primers were prepared for a large number sites at 

10 both ends of the Salmonella target sequence. 

Amplifications were carried out on genomic DNA from the 
43 polymorphic Salmonella for a variety of these primer 
combinations according to the protocol listed below. In 
cases where a given primer combination produced an 

15 amplification product in over 90% of the polymorphic 

Salmonella r additional primers were then prepared which 
moved one of the priming sites upstream or downstream 
one base at a time. Once the priming site that found 
the highest portion of polymorphic Salmonella was 

20 identified, that site was fixed and then additional* 
primers were prepared which moved the priming site at 
the other end of the Salmonella target sequence upstream 
or downstream one base at a time? The combination of 
priming sites which produced an amplification product 

25 for the highest percentage of polymorphic Salmonella 
would then be evaluated at the next stage of the 
screening procedure. 

Primer-screening amplification reactions were 
conducted using the following procedure : 

30 Combine 1.5 Jll - dNTP mix (5 mM each dNTP) , 40 fll - 

deionized water, 5 (ll - 10X reaction buffer (500 mM KC1, 
100 mM tris 0 pH 8.3, 15 mM MgCl2, 0.003% gelatin) 
0.4 \il - Taq polymerase (5U/JIL) , 1.2 M-l - Taq dilution 
buffer (10 mM tris 6 pH 8.0 and 1.0% Tween 20), 0.66 p.1 

35 - each primer (26-mer 0 10 flM), and 1 . 0 Jll - genomic DNA 
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@ 50 ng/|il. Heat to 94°C for 2 minutes. Run 35 cycles 
of 15"@ 94°C; 3' @ 72°C. Combine a 5 p.1 aliquot of the 
reaction with 2 111 of Ficol-loading buffer and run on a 
4% acrylamide gel (29:1) /I. OX TBE . 
5 Sample responses were graded as follows : 

If a PCR product was visible at < 5 x 10 4 DNA 
copies per reaction the result was scored as +1. 

If a PCR product was only visible when the DNA copy 
number was > 5 x 10 4 copies per reaction the test was 
10 scored as +0.5. 

The scores for the 43 strains were summed and 
divided by 43. The results of the evaluation were 
assembled in Table III. 
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The numbers on the rows and columns represent the 
3' positions relative to Sequence ID No . 1 of the two 
primers used in the amplification reaction. Based on 
the results of the primer site evaluation, four 
5 locations on the target sequence were sufficiently well 
conserved to yield priming sites capable of capturing 
over 95% of the polymorphic Salmonella. These sites 
were found in the following locations on the target 
sequence as displayed in Figure 2; 59-60, 534-536, 665 

10 and 761. The 761 and 534 sites were selected over the 
665 site because priming sites surrounding the 7 61 and 
534 base positions detected a higher portion of the 
polymorphic Salmonella. Both the 59 and 60 sites were 
evaluated as possible priming sites for the 

15 complementary strand of the target. The sequences for 
these primers are shown in Table IV. 

TABLE IV 

PtHtt^t- .qgnm^nrrgs Fonnri in at T.eaat 35% 
gf the Polymorphic Salmonella 

#59-26 TTA GCC GGG ACG CTT AAT GCG GTT AA Sequence ID No. 14 

#60-26 TAG CCG GGA CGC TTA ATG CGG TTA AC Sequence ID No* 15 

#534rc-26 CTA TTT TCT GGC CTG ACG CTA TGA CC Sequence ID No. 16 

#53 6rc-2 6 TTC TAT TTT CTG GCC TGA CGC TAT GA Sequence ID No. 17 

#665rc-26 CAT TCA GGA TGC AGG CGA TAG TAG CC Sequence ID No. 18 

#7 6lrc-26 CTT TAC CGC TTC CAG TGT GGC CTG AA Sequence ID No. 19 

Primer nomenclature: 

The first number indicates the 3' position of the 
primer on the Salmonella target sequence in Figure 2. 
20 The rc indicates that the primer sequence is derived 

from the reverse complementary strand. The 2 6 indicates 
the length of the primer. 
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positive control to indicate that conditions were 
sufficient for amplification of the Salmonella target 
sequence to take place. The lanes are correlated with 
the non- Salmonella and Salmonella strains as follows: 

& 

Lane spagi&a ffr>H T n - No - Lane fippirlftn anri T . D . Nq . 

1 Escherichia coli 25 7 Enterobacter cloacae 123 

2 Escherichia coli 33 8 Enterobacter cloacae 221 

3 Escherichia coli 57 9 Enterobacter cloacae 313 

4 Escherichia coli 84 10 Enterobacter cloacae 375 

(ATCC 13047) 

5 Escherichia coli 139 11 Proteus mirabilis 360 

6 Salmonella rvphimurium 897 12 Proteus mirabilis 364 

a 

Lane Sporr-ieR anH T D. No. LSIlfi fi pftC i eg ftTlti T.I? . NO ■ 

1 Proteus morganii 99 7 Proteus vulgaris 959 

2 Proteus morganii 363 8 Enterobacter agglomerans 905 

3 Salmonella enter! tidis 1109 9 -Enterobacter aerogenes 62 

4 Pjroteus vulgaris 273 10 Enterobacter aerogenes 37 6 

(ATCC 13048) 

5 Proteus vulgaris 275 11 Klebsiella pneumoniae 373 

(ATCC 13BB3) 

6 Pz-oteus vulgaris 385 12 Klebsiella pneumoniae 749 
(ATCC 13315) 

i, ans spp^^ and t-d. No. iana fipftfilfin and I . P . No . 

1 Listeria monocytogenes 938 7 Citrobacter freundii 896 

2 Listeria monocytogenes 941 8 Citrobacter diversus 217 

3 Listeria innocua 1157 9 Hafnia alvei 934 

4 Listeria ivanovii 1167 10 Serratia marcesens 372 

5 Salmonella infantis 908 11 Enterococcus faecalis 283 

(ATCC 19433) 

6 Citrobacter freundii 361 12 Yersinia enterocolitica 750 
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c. 




Lane 
1 


Sp^ies and 1 ~D . NQ. 


Lane 


fineries and I.D. No. 


Staphylococcus aureus 118 


1 


Staphylococcus saprophytlcus 788 


2 


Staphylococcus aureus 207 


Q 
0 


i m nnp 7 7^ saintoflul 1086 


3 


Staphylococcus aureus 610 


9 


Shigella sonnei 701 


4 


Staphylococcus aureus 812 


10 


Shigella Jboydii 1081 
(ATCC 8700) 


5 


Staphylococcus varneri 793 


11 


Shigella dysenteria 1082 
(ATCC 13313) 


6 


Staphylococcus saprophyticus 762 


12 


Shigella flexneri 1083 
(ATCC 29903) 



Of the 100 strains which were evaluated only one 
strain which was tentatively identified as Hafnia alvei, 
gave a false positive result with the 60-26 and 761rc-26 
primer set. The identity of this false positive is 
5 considered ambiguous because although its ribotyping 

pattern appears to be closer to Hafnia alvei than to the 
genus Salmonella, the strain appears to be biochemically 
closer to Salmonella. The remaining 35 strains of 
Hafnia alvei, which were screened all tested negative 

10 for the presence of the Salmonella test sequence. 

Primer combinations using 3' sites at base positions 59 
or 60 along with complementary strand priming sites 'at 
534 or 53 6 all generated amplification products with at 
least 20% of the negative test panel. Since this rate 

15 of false positives was unacceptable for use in the 

preferred embodiments only the 60-26 and 7 61rc-26 primer 
set was selected for the further evaluation. The 
fragment of Figure 2 flanked and included by these 
primers included nucleic acid bases starting at position 

20 35 and ending at position 786; this is the diagnostic 

target of the invention for Salmonella. Position 35 to 
786 of Sequence ID No. 1 is designated Sequence ID 
No. 21. Position 35 to 786 of Sequence ID No. 20 is 
designated as Sequence ID No. 22. 
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The detection efficiency of the diagnostic marker 
primers 60-2 6 and 761rc-26 primers was then evaluated on 
a test group of over 1480 Salmonella strains. A 
breakdown of the test group by subgenus group and 
5 serotype is shown in Table V. 

TABLE V 



List of S almonella Serotypes Comprising the Test GrOUP 
fcfrfi 60-26 and Primers 



Serot voe / Subaenus 


No. 


Serotvne/Suboenus 


No. 


Salmonella 


abaetetuba F 


3 


Salmonella 


london El 


2 


Salmonella 


adabraka El 


1 


Salmonella 


madelia H 


2 


Salmonella 


adelaide O 


11 


Salmonella 


raanchester C2 


3 


Salmonella 


agaxna B 


2 


Salmonella 


manhatten C2 


5 


Salmonella 


agona O 


25 


Salmonella 


manila E2 


2 


Salmonella 


ajiobo G2 


2 


Salmonella 


mbandaka CI 


14 


Salmonella 


alabama Dl 


2 


Salmonella 


meleagridis £1 


3 


Salmonella 


albany C3 


5 


Salmonella 


minnesota L 


5 


Salmonella 


altendorf B 


2 


Salmonella 


xnississippi G2 


4 


Salmonella 


amsterdaxn £1 


7 


Salmonella 


inonte video CI 


9 


Salmonella 


anatum £1 


42 


Salmonella 


morehead N 


2 


Salmonella 


arechavaleta B 


2 


Salmonella 


muenchen C2 


11 


Salmonella 


arkansas E3 


11 


Salmonella 


muenster El 


10 


Salmonella 


austin CI 


2 


Salmonella 


napoli Dl 


7 


Salmonella 


bare illy CI 


8 


Salmonella 


newbrunswick E2 


5 


Salmonella 


berta Dl 


13 


Salmonella 


newington E2 


1 


Salmonella 


binza E2 


19 


Salmonella 


newport C2 


26 


Salmonella 


blockley C2 


4 


Salmonella 


nyborg El 


2 


Salmonella 


bodjonegoro N 


2 


Salmonella 


Ohio CI 


53 


Salmonella 


braenderup CI 


30 


Salmonella 


oranienburg CI 


8 


Salmonella 


brandenburg B 


9 


Salmonella 


othmarschen CI 


5 


Salmonella 


bredeney B 


14 


Salmonella 


panama Dl 


8 


Salmonella 


California B 


7 


Salmonella 


paratyphi A 


1 


Salmonella 


cerro K 


13 


Salmonella 


poona Gl 


2 


Salmonella 


champaign Q 


2 


Salmonella 


pullorum Dl 


21 


Salmonella 


chandans F 


5 


Salmonella 


reading B 


8 
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Salmonella 


choleraesuis CI 


13 


Salmonella 


redlands I 


2 


Salmonella 


corvallia C3 


6 


Salmonella 


rostock Dl 


2 


Salmonella 


cubana G2 


21 


Salmonella 


rubislaw F 


5 


Salmonella 


daressalaam 8 


1 


Salmonella 


saintpaul B 


10 


Salmonella 


derby B 


8 


Salmonella 


sandiego B 

c 


7 


Salmonella drypool E2 


11 


Salmonella 


Santiago C3 


48 


Salmonella 


dublin Dl 


14 


Salmonella 


schwarzengr. B 


10 


Salmonella 


durham G2 


5 


Salmonella 


sculcoates 


2 


Salmonella ealing O 


3 


Salmonella 


senftenberg E4 


56 


Salmonella 


enteritidis Dl 


124 


Salmonella 


sladun B 


2 


Salmonell a 


eschweiler CI 


2 


Salmonella 


Stanley B 


7 


Salmonell a 


ferlac H 


2 


Salmonella 


Stanleyville B 


3 


Salmonella 


gallinarum 0 


3 


Salmonella 


sya X 


4 




give El 


4 


Salmonella 


tennessee CI 


19 


Salmonella 


haardt 0 


12 


Salmonella 


thomasville E3 


11 


Salmonella 


hadar C2 


17 


Salmonella 


thompson CI 


16 


Salmonella 


havana G2 


15 


Salmonella 


typhi Dl 


2 


Salmonella 


heidelberg B 


20 


Salmonella 


typhimurium B 


97 


Salmonella 


Indiana B 


13 


Salmonella 


urbana N 


2 


Salmonella 


infantis CI 


31 


Salmonella 


virchow CI 


14 


Salmonella 


johannesburg R 


5 


Salmonella 


waycross S 


2 


Salmonella 


kedougou G2 


7 


Salmonella 


worthington G2 


11 


Salmonella 


kentucky C3 


11 


Salmonella 


Group I species 


211 


Salmonella 


kiaxnbu B 


2 


Salmonella 


Group II species 


23 


Salmonella 


krefeld E4 


2 


Salmonella 


Group Ilia species 


39 


Salmonella 


kubacha B 


4 


Salmonella 


Group Hlb species 


19 


Salmonella 


lexington El 


7 


Salmonella 


Group IV species 


16 


Salmonella 


lille CI 


8 


Salmonella 


Group V species 


2 


Salmonella 


livings ton CI 


9 









This pair of priming sites proved to be extremely 
accurate in detecting Salmonella strains from all six 
subgenus groups in the genus Salmonella. The 1390 
strains of Group I Salmonella were detected at an 
5 efficiency of 99.75%. Although the remaining five 
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subgenus groups contained considerably fewer strains , 
the strains comprising all these groups were detected at 
100% efficiency. The detection efficiency of the 60 and 
7 61 priming sites for the individual subgenus groups and 
5 the entire Salmonella test group are shown in Table VI. 

TfiftLF, VI 

Summaxy nf salm sm&lla Detaching Efficiency 

for- thfi 60-26 *nH 761rn-26 Primer Set 



Total Subgenus Group I Tested 1390 

Total Positive 1386.5 
% Positive 99.75 

Total Subgenus Group II Tested 23 
Total Positive 23 

% Positive 100 

Total Subgenus Group Ilia Tested 39 
Total Positive 39 

% Positive 100 

Total Subgenus Group Illb Tested 19 
Total Positive 19 

% Positive 100 

Total Subgenus Group IV Tested 16 
Total Positive 16 

% Positive 100 

Total Subgenus Group V Tested 2 
Total Positive 2 

% Positive 100 

Total Salmonella Tested 1489 

Total Salmonella Positive 1485.5 
% Positive 99.76 
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If a PCR product was visible at < 5 x 10 4 DNA 
copies per reaction the result was scored as +1 . 

If a PCR product was only visible when the DNA copy 
number was > 5 x 10 4 copies per reaction the test was 
5 scored as +0.5. 

A. representative composite showing PCR 
amplification products for the Salmonella strains listed 
below is shown in Figure 5. 

Figure 5 shows the amplification products formed 
10 using the 60-2 6/7 61rc-2 6 primer set as separated by gel 
electrophoresis for samples of 44 Salmonella. The lanes 
are correlated with Salmonella strains as follows: 



Lane 
1 


Species- 


and l.D.-Mo* 


Lane 


<5p*rrles and I.D . No ■ 


Salmonella 


abaetetuba 1550 


7 


Salmonella anatuzn 1501 


2 


Salmonella 


adabraka 2340 


8 


Salmonella anatum 2744 


3 


Salmonella 


agona 1353 


9 


Blank 


4 


Salmonella 


agona 1446 


10 


Salmonella binza 1432 


5 


Salmonella 


agona 2339 


11 


Salmonella binza 2682 


6 


Salmonella 


altendorf 1654 


12 


Salmonella brandenburg 1355 



fi. 



Lane 
1 


Spiles and T . D . No ■ 


T.arte 


Species 


and I -D . No. 


Salmonella enteritidis 706 
(ATCC 6962) 


7 


Salmonella 


hadar 1231 


2 


Salmonella enteritidis 890 


8 


Salmonella 


havana 2245 


3 


Salmonella eschweiler 1647 


9 


Salmonella 


havana 2271 


4 


Salmonella gallinarum 1635 


10 


Blank 




5 


Salmonella gallinarum 2350 


11 


Salmonella 


heidelberg 1238 


6 


Salmonella haardt 1344 


12 


Salmonella 


heidelberg 1239 



Lang Spaces apH T r> No . . Lane SpflC i eS flnfl T , D . NO . 

1 Salmonella Indiana 1480 7 Salmonella kentucky 2195 

2 Salmonella infantis 727 8 Salmonella kentucky 2756 

3 Blank 9 Salmonella kentucky 2759 
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4 Salmonella infant is 1437 

5 Salmonella kedougou 1251 

6 Salmonella kedougou 1254 

I 

Lane Spmig.g anfi T.D. No. 

1 Salmonella typhimurium 1253 

2 Salmonella typhimurium 1499 

3 Salmonella typhimurium 1509 

4 Salmonella urbana 1663 

5 Salmonella virchow 738 

6 Salmonella virchow 1241 



10 Salmonella kentucky 2769 

11 Salmonella kiambu 919 . 

12 Salmonella lexington 1649 

Lane Fpe^jga anH T.n. No. 

7 Salmonella virchow 1256 

8 Salmonella virchow 1370 

9 Salmonella virchow 1431 

10 Blank 

11 Salmonella worthington 2 638 

12 Salmonella vrindaban 2314 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) 



APPLICANT 
(A) NAME: 



E. I. DU PONT DE NEMOURS AND 
COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITY : WILMINGTON 

(D) STATE: DELAWARE 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE (ZIP) : 19898 
TELEPHONE: 302-8 92-8112 
TELEFAX: 302-773-0164 



(G) 
(H) 



(I) TELEX: 6717325 
(ii) TITLE OF INVENTION: 



SELECTION OF DIAGNOSTIC 
GENETIC MARKERS IN 
MICROORGANISMS AND USE 
OF A SPECIFIC MARKER 
FOR DETECTION OF 
SALMONELLA 



(iii) NUMBER OF SEQUENCES: 22 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.50 INCH 
<B) COMPUTER: MACINTOSH 

(C) OPERATING SYSTEM: MACINTOSH, 6,0 

(D) SOFTWARE: MICROSOFT WORD, 4.0 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: GEIGER, KATHLEEN W. 

(B) REGISTRATION NUMBER: 35,880 

(C) REFERENCE/DOCKET NUMBER: MD-1068 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 





CAGCCGCGAG 


GATGATATGG 


ATG TTAGCCG 


GGACGCTTAA 


TGCGGTTAAC 


60 


GCCATGCCGA 


CACCAGCGCC 


CGCCAGCGTG 


CCGAAACTGT 


AGAAACCATG 


CATCATCGGC 


120 


AGAACGGTTT 


TATTCAGCTC 


GCGTTCGACC 


GCCGCGCCTT 


CGACATTAAT 


CGCCACTTCG 


180 


GCGGCGCCAA 


AACTGGCGCC 


GAAAACGGCT 


AATCCAAGGG 


CAAAAATCAG 


CGGCGAGGCG 


240 


CACCACAGCG 


CGACGCTAAG 


AATAACCATC 


CCGGTTACTG 


CACAGGTCAT 


CGTCGTGCGA 


300 


ATAACCTTCC 


GGGTGCCAAA 


TCGTTTCACC 


AGCCAGGCGG 


AACAAAGAAT 


ACCGCTCATT 


360 


GAACCGATAG 


AAAGCCCGAA 


TAAGACCGCC 


CCCATTTCCG 


CGGTAGAGAC 


GGAAAGAATA 


420 


TCCCGAATAG 


CAGGCGTTCG 


GGTTGCCCAG 


GAGGCCATCA 


GCAGTCCGGG 


TAAAAAGAAG 


480 


AACATAAACA 


GCGCCCAGGT 


ACGGCGTTTT 


AAGGCGTTAC 


G7GAGGAGAG 


GACGGTCATA 


540 


GCGTCAGGCC 


AGAAAATAGA 


AGCGAGAGGT 


AAACATTAGC 


AAGCTTGTGT 


ACATTTGTAC 


600 


ATATCATCGT 


CATACTTCAT 


TGTGCAGACA 


GTTTTTACTG 


TCTGTTTTTT 


CAGCGTAAGC 


660 


GGCAGGCTAC 


TATCGCCTGC 


ATCCTGAATG 


AGATGTGGAA 


CTCATCATGA 


AAGAAAATGC 


720 


CGTAAGCGCG 


CCAATGATCC 


TAAGCGACGG 


GAAAAAATAA 


TTCAGGCCAC 


ACTGGAAGCG 


780 


GTAAAGACCT 


ATGGCACTCT 


GCCGTGACTA 


A 






811 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
AGCTGATGCT AC 



WO 95/33854 



PCT/US95/06704 



45 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGTCGAACTG TC 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TTAGTCACGG CA 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGCGATACCG TA 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTACAGCTGA TG 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTCAGTCGAA CT 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGCATTAGTC AC 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGTATGCGAT AC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GACGCTTAAT GCGGTTAACG CCA 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AACCATGCAT CATCGGCAGA ACG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
AGTAGCCTGC CGCTTACGCT GAA 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TCAGGATGCA GGCGATAGTA GCC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

TTAGCCGGGA CGCTTAATGC GGTTAA 26 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TAGCCGGGAC GCTTAATGCG GTTAAC 2 6 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CTATTTTCTG GCCTGACGCT ATGACC 26 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



TTCTATTTTC TGGCCTGACG CTATGA 



26 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CATTCAGGAT GCAGGCGATA GTAGCC 26 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTTTACCGCT TCCAGTGTGG CCTGAA 26 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TTAGTCACGG CAGAGTGCCA TAGGTCTTTA CCGCTTCCAG TGTGGCCTGA ATTATTTTTT 60 

CCCGTCGCTT AGGATCATTG GCGCGCTTAC GGCATTTTCT TTCATGATGA GTTCCACATC 120 

TCATTCAGGA TGCAGGCGAT AGTAGCCTGC CGCTTACGCT GAAAAAACAG ACAGTAAAAA 180 

CTGTCTGCAC AATGAAGTAT GACGATGATA TGTACAAATG TACACAAGCT TGCTAATGTT 240 

TACCTCTCGC TTCTATTTTC TGGCCTGACG CTATGACCGT CCTCTCCTCA CGTAACGCCT 300 

TAAAACGCCG TACCTGGGCG CTGTTTATGT TCTTCTTTTT ACCCGGACTG CTGATGGCCT 360 

CCTGGGCAAC CCGAACGCCT GCTATTCGGG ATATTCTTTC CGTCTCTACC GCGGAAATGG 420 
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GGGCGGTCTT ATTCGGGCTT TCTATCGGTT CAATGAGCGG TATTCTTTGT TCCGCCTGGC 480 

TGGTGAAACG ATTTGGCACC CGGAAGGTTA TTCGCACGAC GATGACCTGT GCAGTAACCG .540 

GGATGGTTAT TCTTAGCGTC GCGCTGTGGT GCGCCTCGCC GCTGATTTTT GCCCTTGGAT 600 

TAGCCGTTTT CGGCGCCAGT TTTGGCGCCG CCGAAGTGGC GATTAATGTC GAAGGCGCGG 660 

CGGTCGAACG CGAGCTGAAT AAAACCGTTC TGCCGATGAT GCATGGTTTC TACAGTTTCG 720 

GCACGCTGGC GGGCGCTGGT GTCGGCATGG CGTTAACCGC ATTAAGCGTC CCGGCTAACA 780 

TCCATATCAT CCTCGCGGCT GCCGTGACTA A 811 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



TAGCCGGGAC 


GCTTAATGCG 


GTTAACGCCA 


TGCCGACACC 


AGCGCCCGCC 


AGCGTGCCGA 


60 


AACTGTAGAA 


ACCATGCATC 


ATCGGCAGAA 


CGGTTTTATT 


CAGCTCGCGT 


TCGACCGCCG 


120 


CGCCTTCGAC 


ATTAATCGCC 


ACTTCGGCGG 


CGCCAAAACT 


GGCGCCGAAA 


ACGGCTAATC 


180 


CAAGGGCAAA 


AATCAGCGGC 


GAGGCGCACC 


ACAGCGCGAC 


GCTAAGAATA ACCATCCCGG 


240 


TTACTGCACA 


GGTCATCGTC 


GTGCGAATAA 


CCTTCCGGGT 


GCCAAATCGT 


TTCACCAGCC 


300 


AGGCGGAACA 


AAGAATACCG 


CTCATTGAAC 


CGATAGAAAG 


CCCGAATAAG 


ACCGCCCCCA 


360 


TTTCCGCGGT 


AGAGACGGAA 


AGAATATCCC 


GAATAGCAGG 


CGTTCGGGTT 


GCCCAGGAGG 


420 


CCATCAGCAG 


TCCGGGTAAA 


AAGAAGAACA 


TAAACAGCGC 


CCAGGTACGG 


CGTTTTAAGG 


480 


CGTTACGTGA 


GGAGAGGACG 


GTCATAGCGT 


CAGGCCAGAA 


AATAGAAGCG AGAGGTAAAC 


540 


ATTAGCAAGC 


TTGTGTACAT 


TTGTACATAT 


CATCGTCATA 


CTTCATTGTG 


CAGACAGTTT 


600 


TTACTGTCTG 


TTTTTTCAGC 


GTAAGCGGCA 


GGCTACTATC 


GCCTGCATCC 


TGAATGAGAT 


660 


GTGGAACTCA 


TCATGAAAGA 


AAATGCCGTA 


AGCGCGCCAA 


TGATCCTAAG 


CGACGGGAAA 


720 


AAATAATTCA 


GGCCACACTG 


GAAGCGGTAA 


AG 






752 



(2) INFORMATION FOR SEQ ID NO: 22: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 



CTTTACCGCT 


TCCAGTGTGG 


CCTGAATTAT 


TTTTTCCCGT CGCTTAGGAT 


CATTGGCGCG . 


60 


CTTACGGCAT 


TTTCTTTCAT 


GATGAGTTCC 


ACATCTCATT CAGGATGCAG 


GCGATAGTAG 


120 


CCTGCCGCTT 


ACGCTGAAAA 


AACAGACAGT 


AAAAACTGTC TGCACAATGA AGTATGACGA 


180 


TGATATGTAC 


AAATGTACAC 


AAGCTTGCTA 


ATGTTTACCT CTCGCTTCTA 


TTTTCTGGCC 


240 


TGACGCTATG 


ACCGTCCTCT 


CCTCACGTAA 


CGCCTTAAAA CGCCGTACCT 


GGGCGCTGTT 


300 


TATGTTCTTC 


TTTTTACCCG 


GACTGCTGAT 


GGCCTCCTGG GCAACCCGAA 


CGCCTGCTAT 


360 


TCGGGATATT 


CTTTCCGTCT 


CTACCGCGGA 


AATGGGGGCG GTCTTATTCG 


GGCTTTCTAT 


420 


CGGTTCAATG 


AGCGGTATTC 


TTTGTTCCGC 


CTGGCTGGTG AAACGATTTG 


GCACCCGGAA 


480 


GGTTATTCGC 


ACGACGATGA 


CCTGTGCAGT 


AACCGGGATG GTTATTCTTA 


GCGTCGCGCT 


540 


GTGGTGCGCC 


TCGCCGCTGA 


TTTTTGCCCT 


TGGATTAGCC GTTTTCGGCG 


CCAGTTTTGG 


600 


CGCCGCCGAA 


GTGGCGATTA ATGTCGAAGG 


CGCGGCGGTC GAACGCGAGC 


TGAATAAAAC 


660 


CGTTCTGCCG 


ATGATGCATG 


GTTTCTACAG 


TTTCGGCACG CTGGCGGGCG 


CTGGTGTCGG 


720 


CATGGCGTTA ACCGCATTAA GCGTCCCGGC 


TA 




752 
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WHAT IS CLAIMED IS : 

1. A method of determining whether an unknown 
bacterium is a member of the genus Salmonella, 
comprising analyzing the genomic DNA of said unknown 

5 bacterium to detect the presence of nucleic acid 
Sequence ID No, 1 or Sequence ID No. 20. 

2. The method of Claim 1 wherein said analysis 
comprises the steps of : 

(i) performing a PCR amplification reaction 
10 on the genomic DNA of said unknown 

bacterium using a pair of primers 
comprising a first primer and a second 
primer wherein said first primer has a 
nucleic acid sequence derived from 
15 Sequence ID No. 1 and said second primer 

has a nucleic acid sequence derived from 
Sequence ID No. 20; and 
(ii) detecting the presence of DNA which has 
been amplified by said primer pair of 
20 step (i); 

whereby the presence of amplified DNA at step (ii) 
indicates that said unknown bacterium is a member of the 
genus Salmonella . 

3. The method of Claim 2 wherein at step (i) said 
25 first primer is selected from the group consisting of 

Sequence ID Nos. 14 and 15, and said second primer is 
selected from the group consisitng of Sequence ID 
Nos. 16, 17, 18, and 19. 

4. The method of Claim 2 wherein at step (i) said 
30 first primer is Sequence ID No. 15 and said second 

primer is Sequence ID No. 19. 

5. The method of Claim 1 wherein said analysis 
comprises contacting the genomic DNA of said unknown 
organism with a nucleic acid probe wherein said probe 

35 consists essentially of a nucleic acid sequence which is 
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complimentary to Sequence ID Nos . 1 or 20, or a fragment 
thereof, and further, detecting the presence of said 
hybridized probe. 

6. An isolated nucleic acid fragment having 
5 Sequence ID No. 1 or a fragment thereof. 

7. An isolated nucleic acid fragment having 
Sequence ID No. 20 or a fragment thereof. 

8. An isolated nucleic acid fragment having 
Sequence ID No. 14. 

10 9. An isolated nucleic acid fragment having 

Sequence ID No. 15. 

10. An isolated nucleic acid fragment having 

Sequence ID No. 16. 

11. An isolated nucleic acid fragment having 

15 Sequence ID No. 17. 

12. An isolated nucleic acid fragment having 
Sequence ID No. 18. 

13. .An isolated nucleic acid fragment having 
Sequence ID No. 19. 

20 14. An isolated nucleic acid fragment having 

Sequence ID No . 21 • 

15. An isolated nucleic acid fragment having 
Sequence ID No. 22. 

16. An isolated nucleic acid fragment having 

25 Sequence ID No. 4. 
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Figure 2A 

TTAGTCACGG CAGCCGCGAG GAXGAXATGG ATC ^tItAGCCG GGACGCTTAA TGCGGTTAJ^C 



AAXCAGTGCC GTCGGCGCXC CTACTATACC TACAATCGGC CCTGCGAAXT ACGCCAATTG 



35 CGCAGTCCGG lUTTTlA TCX T3CGCTCTCCA TTTGTAATCG TTCGAACACA TGTAAACATG 



C 



60 



10 GCCATGCCGA CAGCAGCGCC CGCCAGCGTG CCGAAACXGT AGAAACCAXG CATCATCGGC 120 

CGGTACGGCT GXGGTCGCGG GCGGTCGCAC GGCTXTGACA TCTTTGGTAC GTAGTAGCCG 

m 

AGAACGGTTT TATTCAGCTC GCGTTCGACC GCCGCGCCTT CGACATTAAT CGCCACTTCG 180 
TCTTGCCAAA AXAAGTCGAG CGGAAGCTGG CGGCGCGGAA GCTGTAAXTA GCGGTGAAGC 

15 

GCGGCGCCAA AACTGGCGCC GAAAACGGCT AATCCAAGGG CAAAAAXCAG CGGCGAGGCG 240 
CGCCGCGGTT TTGACCGCGG C TTT T GCCGA TTAGGTTCCC GTTTTXAGTC GCCGCTCCGC 

CACCACAGCG CGACGCTAAG AATAACCATC CCGGTTACTG CACAGGTCAT CGTCGTGCGA 300 
20 GTGGTGTCGC GCTGCGATXC TTATTGGTAG GGCCAATGAC GTGTCCAGTA GCAGCAGGCT 

ATAACCTXCC GGGXGCCAAA TCGTTTCACC AGCCAGGCGG AACAAAGAAT ACCGCTCATT 360 
TATTGGAAGG CCCACG G T T T AGCAAAGTGG XCGGTCCGCC TTGTTTCTTA TGGCGAGTAA 

25 GAACCGATAG AAAGCCCGAA TAAGACCGCC CCCATTTCCG CGGTAGAGAC GGAAAGAATA 420 

CTTGGCTATC TTTCGGGCTT ATTCTGGCGG GGGTAAAGGC GCCAXCTCTG C CT TT C T TA T 

TCCCGAAXAG CAGGCGTTCG GGTTGCCCAG GAGGCCATCA GCAGTCCGGG TAAAAAGAAG 480 
AGGGCTTATC GTCCGCAAGC CCAACGGGTC CTCCGGTAGT CGTCAGGCCC ATTTTTCTT C 



AA.CAXAAACA GCGCCCAGGT ACGGCGTTTT AAGGCGTTAC GTGAGGAGAG GA CGGTCAIA, 540 

TTGXATTTGT CGCGGGTCCA TGCCGCAAAA TTCCGCAATG CACTCCTCTC CT 

Seo_."yo toe IU> — — 1 

GCGTCAGGCC A GAAAATAGA AGCGAGAGGT AAACATTAGC AAGCTTGTGT ACATTTGTAC 600 
— 1 



I 



WO 95/33854 



PCT/US95/06704 



4/9 



15 



AXAXCAXCGT CfiTACTTCAT TGTGCAGACA GTTTTTACTG XCTGTTTTTT CAGCGTAAGC 660 
TAXAGTAGCA GXATGAAGTA ACACGTCTGT CAAAfiAXGAC AGACAAAAAA GTCGCATTCG 

GGCAGGCTAC XATCGfCCTGC AXCCTGAATG AGAXGTGGAA CTCATCATGA AAGAAAATGC 720 



CCG ipCGATG AIAGCGGACG TAGGfi.CTTA.Cj TCTACACCTT GAGTAGTACT TTCTTTTACG 

CGTAAGCGCG CCAAXGATCC TAAGCGACGG GAAAAAAXAA TTCAGGCCAC ACTGGAAGCG 780 
GCATTCGCGC GGTTACTAGG ATTCGCTGCC CTTTTTTA TT tAAGTCCGGTG TGACCTTCGC 

GTAAAGACCT ATGGCACTCT GCCGTGACTA A <£— 5s cj u£f>C ^ lQ 1- 
CATTTqTGGfi, XACCGTGAGA CGGCACTGAT T <r> 5*^**0* ID N>0 *2.0 



790 800 810 



Sp^u^c*. *3ZD N>0 ID, 
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FIGURE 4A & 4B 
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FIGURE 4C & 4D 
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FIGURE 5A & 5B 
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